SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT :\Thastrup, Ole 
illin, Soren 

.sen, Lars Kongsbak 
B^om, Sara Petersen 

(ii) TITLE OF INVEJrriON: Novel Fluorescent Proteins 

(iii) NUMBER OF SEQU^iNCES : 20 

(iv) CORRESPONDENCE AiDDRESS : 

(A) ADDRESSEE: \novo Nordisk of North America, Inc. 

(B) STREET: 4 0^ Lexington Avenue, Suite 6400 

(C) CITY: New Y&rk 

(D) STATE: New Yt>rk 

(E) COUNTRY: U. 

(F) ZIP: 10174-64X)1 



(v) 



COMPUTER READABLE 

(A) MEDIUM TYPE: Fl^^py disk 

(B) COMPUTER: IBM PC\ compatible 

(C) OPERATING SYSTEmX PC-DOS /MS -DOS 

(D) SOFTWARE: Patentllj Release #1.0, 



Version #1.25 



(vi) 



CURRENT APPLICATION DAT) 

(A) APPLICATION NUMBER; 

(B) FILING DATE: 17- 

(C) CLASSIFICATION: 



)8/819, 612 
-1997 




(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Gregg, Valeta A. 

(B) REGISTRATION NUMBER: 35\127. 

(C) REFERENCE/DOCKET NUMBER :\ 45 94 . 2 04 -US 

(ix) TELECOMMUNICATION INFORMATION:' 

(A) TELEPHONE: 212-867-0123 

(B) TELEFAX: 212-878-9655 



(2) INFORMATION FOR SEQ ID NO : 1 : 



(i) 



(ii) 
(xi) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO : 1 



TGGAATAAGC TTTATGAGTA AAGGAGAAGA ACTTTT 



36 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 
AAGAATTCGG ATCCCTTTAG TGTCAATTGG AAGTCT 



36 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 nucleotides 



^. N 



(B) TYPE: nucleic acid 
iC\ STRANDEDNESS : single 
(D)\ TOPOLOGY : linear 
(ii) MOLECSrULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CTACCTGTTC CATG<SCCAAC GCTTGTCACT ACTTTCCTCA TGGTGTTCAA TGCTTTTCTA 60 
GATACCC \ 6 7 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 nucleotides 

(B) TYPE: Vucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY^: linear 
MOLECULE TYPE\ cDNA 

SEQUENCE DESCRIPTION: SEQ ID NO : 4 



(ii) 
(xi) 



AAGAATTCGG ATCCCTTTAG TGO 



^TTGG AAGTCT 



36 



(2) 



INFORMATION FOR SEQ ID 
(i) SEQUENCE CHARACTERI5 



(ii) 
(xi) 



:5 : 

SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 0 nucleotides 

(B) TYPE: nucleic a 

(C) STRANDEDNESS: siAfile 

(D) TOPOLOGY: linear 
MOLECULE TYPE: cDNA 

SEQUENCE DESCRIPTION: SfiQ ID NO : 5 : 



AATTGGTACC AAGGAGGTAA GCTTTATGAG 



30 



(2) INFORMATION FOR SEQ ID NO : 6 : 



1/ 



(i) 



(ii) 
(xi) 



SEQXJENCE CHARACTERISTICS : 

(A) LENGTH: 3 0 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
MOLECULE TYPE: cDNA 
SEQUENCE DESCRIPTION: SEQ ID NO :> 



CTTTCGTTTT GAATTCGGAT CCCTTTAGTG 



30 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



CATGGCCAAC GCTTGTCACT ACTCTCTCTT ATGGTGTTCA ATGCTTllT 



48 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 8 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 



CTAGAAAAGC ATTGAACACC ATAAGAGAGA GTAGTGACAA GCGTTGGC 



48 




(2) INFORMATION FOR SEQ ID NO : 9 : 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CATGGCCAAC GOTTGTCACT ACTCTCACTT ATGGTGTTCA ATGCTTTT 



(2) INFORMATIONXFOR SEQ ID NO: 10: 
(i) SEQUENCte CHARACTERISTICS: 

(A) LENGTH: 48 nucleotides 

(B) TYPES: nucleic acid 

(C) STRANCEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYMl : cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CTAGAAAAGC ATTGAACACC AT^AAGTGAGA GTAGTGACAA GCGTTGGC 



(2) INFORMATION FOR SEQ ID\NO:ll: 
(i) SEQUENCE CHARAGTErXsTICS : 

(A) LENGTH: 36 nuc\eotides 

(B) TYPE: nucleic accid 

(C) STRANDEDNESS: si\gle 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: S^ ID NO: 11: 
TGGAATAAGC TTTATGAGTA AAGGAGAAGA AOTTTT 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12^ 

GAATCGTAGA TCTTTATTTG TATAGTTCAT CCATG 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



13 



TGGAATAAGC TTTATGGATC CCGTCGTTTT ACAACGTCGT 



(2) INFORMATION FOR SEQ ID NO : 14 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



GCGfcGAATTC TTATTATTAT TTTTGACACC AGAC 



34 



(2) I^^FORMATION FOR SEQ ID NO: 15: 
{h SEQUENCE CHARACTERISTICS: 
\ (A) LENGTH: 764 nucleotides 
\ (B) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
AD) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



AAGCTTT ATG 
Met 
1 



AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC 
Lys Gly Glu Glu Leu Phe Thr Gly Val Val 
5 10 



CCA ATT CTT 
Pro lie Leu 
15 



52 



GTT GAA TTA GAT 
Val Glu Leu Asp 



5GC GAT GTT AAT GGG CAA AAA TTC TCC GTT 
Ly Asp Val Asn Gly Gin Lys Phe Ser Val 
2^ 25 



AGT GGA GAG 
Ser Gly Glu 
30 



103 



GGT GAA GGT GAT GCA ^CA TAC GGA AAA CTT ACC CTT AAA TTT 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe 
35 \ 40 45 



ATT TGC ACT 
He Cys Thr 



154 



ACT GGG AAG CTA CCT GTT \CCA TGG CCA ACG CTT GTC ACT ACT 
Thr Gly Lys Leu Pro Val ^ro Trp Pro Thr Leu Val Thr Thr 
50 55 \ 60 



CTC TCT CAT 
Leu Ser His 
65 



205 



GGT GTT CAA TGC TTT TCT AGAXTAC CCA GAT CAT ATG AAA CAG 
Gly Val Gin Cys Phe Ser Arg Vyr Pro Asp His Met Lys Gin 
70 \ 7.5 80 



CAT GAC TTT 
His Asp Phe 



256 



TTC AAG AGT GCC ATG CCC GAA GGT\TAT GTA CAG GAA AGA ACT 
Phe Lys Ser Ala Met Pro Glu Gly Vyr Val Gin Glu Arg Thr 
85 90 \ 95 



ATA TTT TAC 
He Phe Tyr 
100 



307 



/ 



AAA GAT GAC GGG AAC TAC AAG ACA CGT\GCT GAA GTC AAG TTT 
Lys Asp Asp Gly Asn Tyr Lys Thr Arg Via Glu Val Lys Phe 
105 V%0 

ACC CTT GTT AAT AGA ATC GAG TTA AAA GGT\ATT GAT TTT AAA 
Thr Leu Val Asn Arg He Glu Leu Lys Gly \le Asp Phe Lys 
120 125 \ 130 



GAA GGT GAT 3 58 

Glu Gly Asp 

115 

GAA GAT GGA 409 

Glu Asp Gly 



AAC ATT CTT GGA CAC AAA ATG GAA TAC AAT TAA AAC TCA CAT 
Asn He Leu Gly His Lys Met Glu Tyr Asn Tyr ^sn Ser His 
135 140 145 



AAT GTA TAC 
Asn Val Tyr 
150 



460 



ATC ATG GCA GAC AAA CCA AAG AAT GGC ATC AAA GTT. AAC TTC 
He Met Ala Asp Lys Pro Lys Asn Gly He Lys ValNAsn Phe 
155 160 \ 165 



AAA ATT AGA 
Lys He Arg 



511 



CAC AAC ATT AAA GAT GGA AGC GTT CAA TTA GCA GAC COT TAT 
His Asn He Lys Asp Gly Ser Val Gin Leu Ala Asp Hi ^ Tyr 
170 175 180 



CAA CAA AAT 
Gin Gin Asn 
185 



562 



ACT CCA ATT GGC GAT GGC CCT GTC CTT TTA CCA GAC AAC 
Thr Pro He Gly Asp Gly Pro Val Leu Leu Pro Asp Asn 
190 195 



TAC CTG TCC 
Tyr Leu Ser 
200 



613 



ACG CAA TCT GCC CTT TCC AAA GAT CCC AAC GAA AAG AGA GAT 
Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp 
205 210 215 



{CAC ATG ATC 
lis Met He 



664 



CTT CTT GAG TTT GTA ACA GCT GCT GGG ATT ACA CAT GGC ATG 
Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met 
220 225 230 



GAT GAA CTA 
As®. Glu Leu 
235 



715 



TAC AAA TAA ATGTCCAGAC TTCCAATTGA CACTAAAGGG ATCCGAATTC 



764 



Tyr Lys 



(2) INFORMATION FOR SEQ ID NO: 16: 

(Vi) SEQUENCE CHARACTERISTICS: 

\(A) LENGTH: 238 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l6: 



Met Ser Lys G\y Glu Glu Leu Phe Thr Gly Val Val 
5 10 



Pro lie Leu Val 
15 



Glu Leu Asp Gly i\sp Val Asn Gly Gin Lys Phe Ser 
20 \ 25 



Val Ser Gly Glu 
30 



Gly Glu Gly Asp Ala\Thr Tyr Gly Lys Leu Thr Leu 
35 \ 40 



Lys Phe lie Cys 
45 



Thr Thr Gly Lys Leu Prdv Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
50 \ 55 60 



Ser His Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 
65 70 \ 75 



His Met Lys Gin 

" 80 



His Asp Phe Phe Lys Ser Ala pfet Pro Glu Gly Tyr 
85 \ 90 



Val Gin Glu Arg 
95 



Thr lie Phe Tyr Lys Asp Asp Gly>Asn Tyr Lys Thr 
100 3Sp5 



Arg Ala Glu Val 
110 



Lys Phe Glu Gly Asp Thr Leu Val Asitv Arg lie Glu 
115 120 



Leu Lys Gly lie 
125 



Asp Phe Lys Glu Asp Gly Asn lie Leu 
130 135 



Gf^ 



His Lys 
140 



Met Glu Tyr Asn 




r Asn Ser His Asn Val Tyr lie Met Ala Asp Lys 
45 150 15^ 

lie Lys Val Asn Phe Lys lie Arg His Asn lie \ys 
165 170 



Pro Lys Asn Gly 
160 

Asp Gly Ser Val 
175 



Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro .lie 
180 185 



s^ly Asp Gly Pro 
190 



Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
195 200 

Lys Asp Pro Asn Glu Lys Arg Asp His Met lie Leu 

210 215 220 



SeV Ala Leu Ser 
205> 

Leu dSLu Phe Val 



Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

AAGCTTT ATG AGT AAA GGA GAA GAA CTT TTC ACT GGA GTT GTC CCA ATT 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He 
1 5 10 



49 



CTT GTT GAA TTA GAT GGC GAT GTT AAT GGG CAA AAA TTC 
Leu Val Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe 
15 \ 20 25 



TCT GTT AGT 
Ser Val Ser 
30 



97 



GGA GAS GGT GAA GGT GAT GCA ACA TAG GGA AAA CTT AGO 
Gly Gl\\Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr 
35 40 



CTT AAA TTT 
Leu Lys Phe 
45 



145 



ATT TGC ACT ACT GGG AAG CTA CCT GTT CCA TGG CCA ACG 
lie Cys Thrv^Thr Gly Lys Leu Pro Val Pro Trp Pro Thr 
50 55 



CTT GTC ACT 
Leu Val Thr 
60 



193 



ACT CTC TCT TAT GGT GTT CAA TGC TTT TCT AGA TAC CCA 
Thr Leu Ser Tyi^ Gly Val Gin Cys Phe Ser Arg Tyr Pro 
65 \ 70 75 



GAT CAT ATG 
Asp His Met 



241 



AAA CAG CAT GAC Tl 
Lys Gin His Asp 
80 



TTC AAG AGT GCC ATG CCC GAA GGT 
Phe Lys Ser Ala Met Pro Glu Gly 
85 90 



TAT GTA CAG 
Tyr Val Gin 



289 



GAA AGA ACT ATA TTT 
Glu Arg Thr lie Phe 
95 IOC 



AAA GAT GAC GGG AAC TAC AAG 
Lys Asp Asp Gly Asn Tyr Lys 
105 



ACA CGT GCT 
Thr Arg Ala 
110 



337 



GAA GTC AAG TTT GAA GGT 
Glu Val Lys Phe Glu Gly 
115 



VT ACQ CTT GTT AAT AGA ATC 
Thr Leu Val Asn Arg lie 
120 



GAG TTA AAA 
Glu Leu Lys 
125 



385 



GGT ATT GAT TTT AAA GAA GAT ^S^GA AAC ATT CTT GGA CAC 
Gly lie Asp Phe Lys Glu Asp gW Asn lie Leu Gly His 
130 \ 135 



AAA ATG GAA 
Lys Met Glu 
140 



433 




TAC AAT TAT AAC TCA CAT AAT GTA TAC ATC ATG GCA 
Tyr Asn Tyr Asn Ser His Asn Val xSfr lie Met Ala 
145 150 



AAT GGC ATC AAA GTT AAC TTC AAA ATT ^XGA CAC AAC 
^sn Gly lie Lys Val Asn Phe Lys lie Arg His Asn 
160 165 \ 170 



AGC GTT CAA TTA GCA GAC CAT TAT CAA CAA >AAT ACT 
Ser Val Gin Leu Ala Asp His Tyr Gin Gin A^n Thr 
175 180 it 



GAC 
Asp 
155 

ATT 
lie 



AAA CCA AAG 
Lys Pro Lys 



AAA GAT GGA 
Lys Asp Gly 



CCA ATT GGC GAT 
Pro lie Gly Asp 
190 



481 



529 



577 



GGC CCT GTC CTT TTA CCA GAC AAC CAT TAC CTGNtCC 
Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu ^er 
195 200 



ACG CAA TCT GCC 
Thr Gin Ser Ala 
205 



625 



CTT TCC AAA GAT CCC AAC GAA AAG AGA GAT CAC AT< 
Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
210 215 



,ATC CTT CTT GAG 
:le Leu Leu Glu 
220 



673 



TTT GTA ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT 
Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp 
225 230 



GAA CTA TAC AAA 
Glu\Leu Tyr Lys 
235 



721 



TAA ATGTCCAGAC TTCCAATTGA CACTAAAGGG ATCCGAATTC 



764 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie 
5 10 



Leu Val Glu Leu Asp Gly Asp 
15 \ 20 

Gly Glu (^ly Glu Gly Asp Ala 
35 

lie Cys Th\ Thr Gly Lys Leu 
50 

Thr Leu Ser Gly Val Gin 

65 

Lys Gin His Asp ^he Phe Lys 
80 \ 85 



Val Asn Gly Gin Lys Phe 
25 

Thr Tyr Gly Lys Leu Thr 
40 

Pro Val Pro Trp Pro Thr 
55 

Cys Phe Ser Arg Tyr Pro 
70 75 

Ser Ala Met Pro Glu Gly 
90 



Ser Val Ser 
30 

Leu Lys Phe 
45 

Leu Val Thr 
60 

Asp His Met 
Tyr Val Gin 



Glu Arg Thr lie Phe. Tyr Lys 
95 \l00 



Asp Asp Gly Asn Tyr Lys 
105 



Thr Arg Ala 
110 



Glu Val Lys Phe Glu G^y Asp 
115 



Thr Leu Val Asn Arg lie 
120 



Glu Leu Lys 
125 



Gly lie Asp Phe Lys GluN^sp 
130 



Gly Asn lie Leu Gly His 
135 



Lys Met Glu 
140 



Tyr Asn Tyr Asn Ser His As? 
14 5 

Asn Gly lie Lys Val Asn Phe 

160 165 

Ser Val Gin Leu Ala Asp His 
175 180 



Gly Pro Val Leu Leu Pro Asp 
195 



Leu Ser Lys Asp Pro Asn Glu 
210 



Val Tyr lie Met Ala Asp. 
150 155 

rs lie Arg His Asn lie 
170 

. Tyi\Gln Gin Asn Thr Pro 
185 

Asn Hiy^ Tyr Leu Ser Thr 
200 

Lys Arg \sp His Met lie 
215 



Lys Pro Lys 
Lys Asp Gly 



lie Gly Asp 
190 

Gin Ser Ala 
205 

Leu Leu Glu 
220 



Phe Val Thr Ala Ala Gly lie 
225 



Thr His GlAMet Asp Glu Leu Tyr Lys 
230 \ 235 



(2) INFORMATION FOR SEQ ID NO:19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 nucleotides 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19-: 



AAGCTTT 


ATG 


AGT 


AAA 


GGA 


GAA 


GAA 


CTT 


TTC 


ACT 


GGA 


G'S;t 


GTC 


CCA 


ATT 






Met 


Ser 


Lys 


Gly 


Glu 


Glu 


Leu 


Phe 


Thr 


Gly 


Va\ 


Val 


Pro 


He 


CTT 


GTT 


GAA 


TTA 


GAT 


GGC 


GAT 


GTT 


AAT 


GGG 


CAA 


AAA 


TTC 


\tct 


GTT 


AGT 


Leu 


Val 


Glu 


Leu 


Asp 


Gly 


Asp 


Val 


Asn 


Gly 


Gin 


Lys 


Phe 


^er 


Val 


Ser 


GGA 


GAG 


GGT 


GAA 


GGT 


GAT 


GCA 


ACA 


TAC 


GGA 


AAA 


CTT 


ACC 


CTT 


AAA 


TTT 


Gly 


Glu 


Gly 


Glu 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gly 


Lys 


Leu 


Thr 


Lea 


Lys 


Phe 


ATT 


TGC 


ACT 


ACT 


GGG 


AAG 


CTA 


CCT 


GTT 


CCA 


TGG 


CCA 


ACG 


CTT 


\gtc 


ACT 


He 


Cys 


Thr 


Thr 


Gly 


Lys 


Leu 


Pro 


Val 


Pro 


Trp 


Pro 


Thr 


Leu 


Val 


Thr 


ACT 


CTC 


ACT 


TAT 


GGT 


GTT 


CAA 


TGC 


TTT 


TCT 


AGA 


TAC 


CCA 


GAT 


cAt 


ATG 


Thr 


Leu 


Thr 


Tyr 


Gly 


Val 


Gin 


Cys 


Phe 


Ser 


Arg 


Tyr 


Pro 


Asp 




Met 


AAA 


CAG 


CAT 


GAC 


TTT 


TTC 


AAG 


AGT 


GCC 


ATG 


CCC 


GAA 


GGT 


TAT 


GTA 


VCAG 


Lys 


Gin 


His 


Asp 


Phe 


Phe 


Lys 


Ser 


Ala 


Met 


Pro 


Glu 


Gly 


Tyr 


Val 


Sin 



49 



97 



145 



193 



241 



289 



GAA 
Glu 


AOA 


ACT 
Thr 


ATA 
He 


TTT 
Phe 


TAC 
Tyr 


AAA 
Lys 


GAT 
Asp 


GAC 
Asp 


GGG 
Gly 


AAC 
Asn 


TAC 
Tyr 


AAG 
Lys 


ACA 
Thr 


CGT 
Arg 


GCT 
Ala 


GAA 
Glu 


GTCV 
Val 


AAG 
\Lys 


TTT 
Phe 


GAA 
Glu 


GGT 
Gly 


GAT 
Asp 


ACC 
Thr 


CTT 
Leu 


GTT 
Val 


AAT 
Asn 


AGA 
Arg 


ATC 
He 


GAG 
Glu 


TTA 
Leu 


AAA 
Lys 


GGT 
Gly 


ATT 
lie 


G^T 
Adv 


TTT 
Phe 


AAA 
Lys 


GAA 
Glu 


GAT 
Asp 


GGA 
Gly 


AAC 
Asn 


ATT 
He 


CTT 
Leu 


GGA 
Gly 


CAC 
His 


AAA 
Lys 


ATG 
Met 


GAA 
Glu 


TAG 
Tyr 


AAT 
Asn 


TAT^ 
Tyr 


SAAC 
Asn 


TCA 
Ser 


CAT 
His 


AAT 
Asn 


GTA 
Val 


TAC 
Tyr 


ATC 
He 


ATG 
Met 


GCA 
Ala 


GAC 
Asp 


AAA 
Lys 


CCA 
Pro 


AAG 
Lys 


AAT 
Asn 


GGC 
Gly 


ATC 
He 


Lyls 


GTT 
Val 


AAC 
Asn 


TTC 
Phe 


AAA 
Lys 


ATT 
He 


AGA 
Arg 


CAC 
His 


AAC 
Asn 


ATT 
He 


AAA 
Lys 


GAT 
Asp 


GGA 
Gly 


AGC 
Ser 


GTT 
Val 


CAA 
Gin 


TTA 
Leu 


ATa 


GAC 
Asp 


CAT 
His 


TAT 
Tyr 


CAA 
Gin 


CAA 
Gin 


AAT 
Asn 


ACT 
Thr 


CCA 
Pro 


ATT 
He 


GGC 
Gly 


GAT 
Asp 


GGC 
Gly 


CCT 
Pro 


GTC 
Val 


CTT 
Leu 


TTA 
Leu 


\^CA 
Fro 


GAC 
Asp 


AAC 
Asn 


CAT 
His 


TAC 
Tyr 


CTG 
Leu 


TCC 
Ser 


ACG 
Thr 


CAA 
Gin 


TCT 
Ser 


GCC 
Ala 


CTT 
Leu 


TCC 
Ser 


AAA 
Lys 


GAT 
Asp 


CCC 
Pro 


AACS 
Asn 


.GAA 
>Slu 


AAG 
Lys 


AGA 
Arg 


GAT 
Asp 


CAC 
His 


ATG 
Met 


ATC 
He 


CTT 
Leu 


CTT 
Leu 


GAG 
Glu 


TTT 
Phe 


GTA 
Val 


ACA 
Thr 


GCT 
Ala 


GCT 
Ala 


GGG 
Gly 


AT^E 
Ile^ 


ACA 
VThr 


CAT 
His 


GGC 
Gly 


ATG 
Met 


GAT 
Asp 


GAA 
Glu 


CTA 
Leu 


TAC 
Tyr 


AAA 
Lys 



TAA ATGTCCAGAC TTCCAATTGA CACJ 



\GGG ATCCGAATTC 



337 



385 



433 



481 



529 



577 



625 



673 



721 



764 




(2) INFORMATION FOR SEQ ID NO: 20 

(i) SEQUENCE CHARACTERISTIC 

(A) LENGTH: 238 amino aci>s^s 

(B) . TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID N5 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly 
5 10 

Leu Val Glu Leu Asp Gly Asp Val Asn Gly 
15 20 

Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly 
35 40 

He Cys Thr Thr Gly Lys Leu Pro Val Pro 
50 55 

Thr Leu Thr Tyr Gly Val Gin Cys Phe Ser 
. 65 70 

Lys Gin His Asp Phe Phe Lys Ser Ala Met 
80 85 

Glu Arg Thr He Phe Tyr Lys Asp Asp Gly 
95 100 

Glu Val Lys Phe Glu Gly Asp Thr Leu Val 
115 120 

Gly He Asp Phe Lys Glu Asp Gly Asn He 
130 135 

Tyr Asn Tyr Asn Ser His Asn Val Tyr He 
145 150 

Asn Gly He Lys Val Asn Phe Lys He Arg 
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Vkl Val Pro He 



Gin \ys 
25 



Phe Ser Val Ser 
30 



Thr Leu Lys Phe 
45 



Lys Le 
Tr-p Pro 

Arg Tyr Pro \^sp 

Gly TyV Val Gin 



Leu Val Thr 
60 

Pro Vsp His Met 
75 



Pro Glu 
90 

Asn Tyr 
105 



Lys Thr ^g Ala 
110 



He Glu Leu Y^ys 
125 



Asn Arg 
Leu Gly 
Met Ala 
His Asn He Lys Asp Gly 



His Lys Met Gl\ 
140 

Asp Lys Pro Lys 
155 




160 



165 



170 



Ser Val Gin Leu Al^>sAsp His Tyr Gin Gin Asn Thr Pro lie Gly Asp 

175 rag i85 i90 

Gly Pro Val Leu Leu Pro As^^^n His Tyr Leu Ser Thr Gin Ser Ala 

195 200 205 



Leu Ser Lys Asp Pro Asn Glu Lys Arg>\sp 
210 215 



His Met lie Leu Leu Glu 
220 



Phe Val Thr Ala Ala Gly lie Thr His Gly Mee- 
225 230 



Glu Leu Tyr Lys 
235 



